GTI at SemEval-2016 Task 4: Training a Naive Bayes Classifier using Features of an Unsupervised System
نویسندگان
چکیده
This paper presents the approach of the GTI Research Group to SemEval-2016 task 4 on Sentiment Analysis in Twitter, or more specifically, subtasks A (Message Polarity Classification), B (Tweet classification according to a two-point scale) and D (Tweet quantification according to a two-point scale). We followed a supervised approach based on the extraction of features by a dependency parsing-based approach using a sentiment lexicon and Natural Language Processing techniques.
منابع مشابه
SteM at SemEval-2016 Task 4: Applying Active Learning to Improve Sentiment Classification
This paper describes our approach to the SemEval 2016 task 4, “Sentiment Analysis in Twitter”, where we participated in subtask A. Our system relies on AlchemyAPI and SentiWordNet to create 43 features based on which we select a feature subset as final representation. Active Learning then filters out noisy tweets from the provided training set, leaving a smaller set of only 900 tweets which we ...
متن کاملUCD-FC: Deducing semantic relations using WordNet senses that occur frequently in a database of noun-noun compounds
This paper describes a system for classifying semantic relations among nominals, as in SemEval task 4. This system uses a corpus of 2,500 compounds annotated with WordNet senses and covering 139 different semantic relations. Given a set of nominal pairs for training, as provided in the SemEval task 4 training data, this system constructs for each training pair a set of features made up of relat...
متن کاملJU_NLP at SemEval-2016 Task 11: Identifying Complex Words in a Sentence
The complex word identification task refers to the process of identifying difficult words in a sentence from the perspective of readers belonging to a specific target audience. This task has immense importance in the field of lexical simplification. Lexical simplification helps in improving the readability of texts consisting of challenging words. As a participant of the SemEval-2016: Task 11 s...
متن کاملSWASH: A Naive Bayes Classifier for Tweet Sentiment Identification
This paper describes a sentiment classification system designed for SemEval-2015, Task 10, Subtask B. The system employs a constrained, supervised text categorization approach. Firstly, since thorough preprocessing of tweet data was shown to be effective in previous SemEval sentiment classification tasks, various preprocessessing steps were introduced to enhance the quality of lexical informati...
متن کاملTor, TorMd: Distributional Profiles of Concepts for Unsupervised Word Sense Disambiguation
Words in the context of a target word have long been used as features by supervised word-sense classifiers. Mohammad and Hirst (2006a) proposed a way to determine the strength of association between a sense or concept and co-occurring words—the distributional profile of a concept (DPC)—without the use of manually annotated data. We implemented an unsupervised naı̈ve Bayes word sense classifier u...
متن کامل